Robust Dialogue State Tracking with Weak Supervision and Sparse Data
نویسندگان
چکیده
Abstract Generalizing dialogue state tracking (DST) to new data is especially challenging due the strong reliance on abundant and fine-grained supervision during training. Sample sparsity, distributional shift, occurrence of concepts topics frequently lead severe performance degradation inference. In this paper we propose a training strategy build extractive DST models without need for manual span labels. Two novel input-level dropout methods mitigate negative impact sample sparsity. We model architecture with unified encoder that supports value as well slot independence by leveraging attention mechanism. combine strengths triple copy matching benefit from complementary predictions violating principle ontology independence. Our experiments demonstrate an can be trained strategies improve robustness towards concepts, topics, leading state-of-the-art range benchmarks. further highlight our model’s ability effectively learn non-dialogue data.
منابع مشابه
Robust Estimation in Linear Regression with Molticollinearity and Sparse Models
One of the factors affecting the statistical analysis of the data is the presence of outliers. The methods which are not affected by the outliers are called robust methods. Robust regression methods are robust estimation methods of regression model parameters in the presence of outliers. Besides outliers, the linear dependency of regressor variables, which is called multicollinearity...
متن کاملReal-Time Robust Tracking with Sparse Representation
Real-Time Robust Tracking with Sparse Representation Visual object tracking plays a critical role in many computer vision applications including visual surveillance, transportation monitoring system etc. A main goal of visual object tracking is to estimate the location of specified object in consecutive frames. For these applications, we need a real time tracker that is robust to various challe...
متن کاملNeural Belief Tracker: Data-Driven Dialogue State Tracking
One of the core components of modern spoken dialogue systems is the belief tracker, which estimates the user’s goal at every step of the dialogue. However, most current approaches have difficulty scaling to larger, more complex dialogue domains. This is due to their dependency on either: a) Spoken Language Understanding models that require large amounts of annotated training data; or b) hand-cr...
متن کاملRobust State Estimation with Sparse Outliers
One of the major challenges for state estimation algorithms, such as the Kalman lter, is the impact of outliers that do not match the assumed Gaussian process and measurement noise. When these errors occur they can induce large state estimate errors and even lter divergence. This paper presents a robust recursive ltering algorithm, the l1-norm lter, that can provide reliable state estimates in ...
متن کاملSnorkel: Rapid Training Data Creation with Weak Supervision
Labeling training data is increasingly the largest bottleneck in deploying machine learning systems. We present Snorkel, a first-of-its-kind system that enables users to train stateof-the-art models without hand labeling any training data. Instead, users write labeling functions that express arbitrary heuristics, which can have unknown accuracies and correlations. Snorkel denoises their outputs...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Transactions of the Association for Computational Linguistics
سال: 2022
ISSN: ['2307-387X']
DOI: https://doi.org/10.1162/tacl_a_00513